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ON THE MAXIMUM NUMBER OF LATIN TRANSVERSALS 


ROMAN GLEBOV AND ZUR LURIA 


Abstract. Let T{n) denote the maximal number of transversals in an order-n Latin 
square. Improving on the bonnds obtained by McKay et al., Taranenko recently 
proved that T(n) < ((1 + o(l))^) , and conjectured that this bound is tight. 

We prove via a probabilistic construction that indeed T{n) — ((1 + o(l))-^)". 
Until the present paper, no superexponential lower bound for T{n) was known. We 
also give a simpler proof of the upper bound. 


1. Introduction 

An order-n Latin square is an n x n matrix L over the symbols [n] := {1,..., n} such 
that every symbol appears exactly once in every row and every column. A transversal 
of L is a set of n entries, one from every row and every column and one of every symbol. 

Latin squares have been studied since ancient times, and the study of their transver¬ 
sals also has a rich history, originating from the study of orthogonal Latin squares. 
Two Latin squares are orthogonal to each other if each encodes a decomposition of 
the other into disjoint transversals. Orthogonal pairs of Latin squares were studied by 
Euler, who proved that for every n ^ 2 mod 4, there exists a pair of orthogonal Latin 
squares of order n. 

Much subsequent work has considered Latin transversals in the context of orthogo¬ 
nality. Aside from this, the literature on Latin transversals has mainly focused on two 
types of questions - existence and enumeration. Regarding the existence question, it is 
not hard to see that Latin squares of even order do not necessarily have transversals. 
Ryser [16] conjectured that in every Latin square of order re, the number of transver¬ 
sals is congruent to re mod 2. Balasubramanian |2| proved Ryser’s conjecture for Latin 
squares of even order. However, it was observed (see, e.g., mai) that counterexam¬ 
ples to Ryser’s conjecture exist for Latin squares of odd order. Subsequently, Ryser’s 
conjecture was weakened to the statement that every Latin square of odd order has a 
transversal - which remains perhaps the most intriguing open question in the study of 
transversals to date. For a survey on the current state of research on Latin transversals, 
see [19]. 

Considering the enumeration problem, let T(L) denote the number of transversals 
in a given Latin square L. 

The order-n eyclic Latin square over an alphabet A is the addition table of a group 
over A isomorphic to (Z„, -b). In 1991, Vardi [18] conjectured the following. 
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Conjecture 1.1. Let Cn denote an order-n cyclic Latin square. Then there exist 
positive constants ci and C 2 such that 

c>! < T{Cn) < c>!. 

Partial results on Conjecture 11.11 were obtained in [3 [HI 13 [6]- However, a lower 
bound of the form conjectured above is still not known. Based on numerical evidence, 
Cooper et al. [3 estimated that perhaps Conjecture 11.11 is true with ci ~ C 2 ~ 0.39. 

Let T{n) denote the maximum number of transversals in an order-re Latin square. 
McKay, McLeod and Wanless m showed that for constants b ~ 1.719 and c ~ 0.614, 

6" <T{n) < c^^/nn\ 

Improving on this, Taranenko m recently proved a better upper bound. Before 
stating her result, we shall need some notation. 

Let A be an order-re (d-l-l)-dimensional array. An element of A is a tuple {ii,, id+i) £ 
[re]'^"''^. A X-element of A is a tuple (H, • • ■ ,id+i) such that A(zi,... ,id+i) = A. A line 
of A is a set of elements obtained by fixing all but one of the indices and allowing the 
free index to vary over [re]. A hyperplane of A is a set of elements obtained by fixing 
one index and allowing the remaining d indices to vary over [re]. 

An order-n d-dimensional Latin hypercube is an [re]'^ array over [re] such that every 
symbol appears exactly once in every line. In the same way that a permutation may 
be represented by a permutation matrix, an order-re d-dimensional Latin hypercube is 
equivalent to an [re]'^'*'^ 0-1 array with exactly one 1 in each line, and in what follows 
we use this dehnition. 

We generalize the notion of a transversal to the high dimensional case. A transversal 
of A is a set of re 1-elements of A, exactly one from each hyperplane. In the case of Latin 
squares, this is equivalent to our previous dehnition. Let T(d, re) denote the maximum 
number of transversals in an order-re d-dimensional hypercube. 

Theorem 1.2 (Theorem 6.1 in [Hj)- For every d > 2 and re —oo, 

/ d-i\^ 

r(d,re) < (^(1 + 0 ( 1 ))^ j . 

In particular, a Latin square has at most ((1 -|- o(l))^)" transversals. This implies 
the upper bound T{Cn) < ((1 + o(l))e“^)’^ 're!, slightly below the numerical estimate 
0.39"’re! of Cooper et al. 

The main contribution of the present paper is proving that this bound is asymptot¬ 
ically tight for all re in the case of Latin squares. As we will discuss in Section [3 it is 
also tight for all d and inhnitely many values of re in the case of general d-dimensional 
Latin hypercubes. 

Theorem 1.3. For every re, 

r(„)> ((i-o(i))A)”, 

Our paper is organized as follows. In Section [2] we present a short alternative proof of 
the upper bound from Theorem ll.21 In Section[3]we give a probabilistic construction of 
an order-re Latin square with at least ((1 — o(l))^) transversals, proving Theorem ll.3l 
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Finally, in Section 0] we discuss possible generalizations of these results and related open 
questions. 


2. A SHORT PROOF OF THEOREM 11.21 

In this section, we present a short proof of Theorem ll.2l based on the entropy method. 
This method has recently emerged as a powerful tool for a variety of counting problems. 
It has been extensively used to obtain upper bounds (see, e.g. na El da EH no]), but it 
has also been used for lower bounds, as in Johanssen, Khan and Vu’s seminal paper [8] 
on the threshold for matchings in hypergraphs. 

The basic idea is as follows. In order to estimate the cardinality of a set S, we sample 
an element A E S' at random. Since H[X) = log(|S|), bounds on H{X) translate into 
bounds on |S|. This is of use because there are powerful information theoretic tools for 
estimating entropies of random variables, and so estimating H{X) is sometimes easier 
than estimating |S| directly. 

Below we summarise the properties of the entropy function that we use. 

(i) The base e entropy of a random variable X is 


H{X) 


Pr(A = x) log 

lEGRange(X) 


1 


Pr(A = x) 


The entropy of a discrete random variable can be interpreted as being the amount 
of information it encodes. 

(ii) For any discrete random variable X, H{X) < log(| Range(A)|), with equality if 
and only if X is uniformly distributed. 

(in) H{X\Y = y) is the entropy of X conditioned on the event Y = y. The conditional 
entropy H{X\Y) is 


H{X\Y)= My = y)HiX\Y = y)=EY[H{X\Y = y)]. 

ygRange(V) 


(iv) The chain rule: If A = (Ai,..., A„) then for any ordering of [n]. 


n 

H{X) = H{Xi\Xj such that j precedes i). 
i=l 

Let A be an order-n d-dimensional Latin hypercube in O-I representation, and let 
A be a transversal of A selected uniformly at random, so that is the number of 

transversals in A. Let Aj denote the element of A chosen for A from the hyperplane 
Ai := A(i, We choose a random ordering of these variables by selecting a 

number Oj € [0,1] uniformly at random for each such hyperplane, and let a = 
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The variables Xi are exposed in order of decreasing Now, 
iog(r(A)) = Hix) 

r ^ 

Eq, H{Xi\Xj such that aj > a^) 

r ^ 

= Eq, ^Exj:aj>ai [H{Xi\Xj = Xj for j such that aj > a*)] 

_ 2=1 

' n 

= Eq, ^^Ex [H{Xi\Xj = Xj for j such that aj > a*)] 

_ 2=1 

n 

(1) =E Ex [Eq, [H{Xi\Xj = Xj for j such that aj > a*)]], 

i=l 

where the last two equalities hold by basic properties of expectation. 

We say that two elements (ii,..., id+i) and (ji,..., jd+i) share an index if there is 
a k such that ik = jk- Given previous variables, Xi cannot choose an element that 
belongs to the same hyperplane as a previousy seen variable. A 1-element in Aj is legal 
if it does not share an index with a previously seen Xj. Let Aj = Ni{a,X) be the 
number of legal elements in Aj. 

We cannot say very much about Aj’s distribution given previously observed variables, 
but Aj certainly must choose a legal element and therefore | Range (Aj)| < Aj. From 

(1) and by applying Jensen’s inequality we obtain 

EQ[iL(Ai|Aj = Xj for j such that aj > aj)] < EQ[log(Aj)] 

= EQjEQ^.:jyj[log(Aj)]] 

(2) < EQjlog(EQ^.,jyj[Aj])]. 

The next step is to compute EQ^.:j^j[Aj] using linearity of expectation. Conditioning 
on X and Oj, what is the probability that a 1-element of the i-th hyperplane is legal? 

There is always one 1-element, the one chosen by Aj, that is legal for Aj regardless 
of the ordering. To compute the probability that a different 1-element of the i-th 
hyperplane is legal, we must count the number of different variables Xj that rule out 
that element for Aj. 

Here we have to be a little careful. In the two dimensional case, every 1-element in 
Aj except Aj is ruled out by exactly two variables. In the general d-dimensional case, 
every 1-element is ruled out by at most d variables. Our task is to show that a typical 
1-element is ruled out by exactly d variables. 

Let V denote the set of 1-elements in Aj. Let U denote the set of 1-elements of Aj 
that share at least two indices with an element of A. The following claim provides an 
upper bound on the size of U. 

Claim 2.1. For the set U as described above, \U\ < d? ■ n^~‘^. 

Proof. Let Aj = •, For every k G [d], there are exactly 1-elements of Aj 

that share with Aj. This accounts for at most d ■ nd~‘^ 1-elements of U. 
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The second possibility is that a 1-element of Ai shares two or more indices with some 
Xii = •, jrf) for i' 7 ^ i. If we fix two coordinates m,£ £ [d], there are exactly n‘^~^ 

1-elements of Ai that share the indices and with Xj/. There are n — 1 possibilities 
to choose the index i' and ( 2 ) possibilities for m and i, so this accounts for at most 
(n — l)( 2 )re‘^“^ elements of U. Summing up the two values, the desired upper bound 
follows. □ 

The next claim states that for every element in V\U, there exist exactly d elements 
of X that rule it out. 


Claim 2.2. For every v = ,jd) £V\U, there exist exactly d indices ii,... ,id, 

all different from each other and from i, such that Xi^, rules out v. 

Proof. Fix a coordinate k G [d]. Since X is a transversal, there is a unique tuple 
such that = jk and X ■ ■ ■ Jd^^^ = 1- Thus, if 

precedes Xi then by the time we see Xi we already know of a 1-element of X that 
shares an index with v, and therefore v is ruled out. 

Note that ik i because otherwise v would share at least two indices, i = ik and 
(k) 

jk = 3k ) with Xi. Since v ^ U, v can share at most one index with an element of X. 
Similarly, im ii for all distinct m,i £ [d] because otherwise, im = ii implies 

(fim,j [^'^,..., = (j-e, , ■ ■ ■, jd'^'^ ) which means that v shares at least two indices, 

jm = = jm and je = = jf\ with (^im,j["'\.. ■ This contradiction 

provides the statement of the claim. □ 


Summarizing Claim 12.11 and Claim 12.21 we obtain 

Eay.j^i[Ni] = ^ Pr(u is legal) 
v&V 

= Pr(u is legal) -|- Pr(u is legal) 

v&u vev\u 

<\U\ + {\V\U\)af 

(3) < d^ • -h ^1 - 

where the factor af in the second summand is the probability that for all d indices 
guaranteed by Claim [TTI the value of a is smaller than Oj, and therefore a particular 
element v £ V \ U precedes all d elements that could have ruled it out. 

Now we are ready to put together the statements mM, and (j3j): 


iog(r(A)) <^Ex 


2=1 


E, 


Oi 


log d 


n^-^ + 


< n y log ^d^ • ^ ^1 — — 

= n (^log(n'^“^) - d + 0 ( 1 )^). 





n 


d-l 



dai 
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Thus, 

AA<((i + »(i))^)”. 

providing the theorem. 

3. Proof of Theorem 11.31 

In this section, we present a probabilistic construction of a Latin square L' = L'{N') 
of order N'^ for every sufficiently large integer N', such that the number of transversals 

in L'is (^(1 - o(l))^j . 

In what follows, we think of an order-n Latin square L as an n x n matrix over an 
alphabet A of cardinality n. The index pairs of L are referred to as positions, and 
elements of the alphabet are the symbols of L. 

Let us choose an integer 6 > 3 such that with n = k = \ h — , and N = n^k, 

we obtain N < N' — 3 as large as possible. Furthermore, denote i = n — bk. Our plan 
is as follows: first we construct a (probabilistic) Latin square L of order N with many 
transversals. Then in the next step, we (deterministically) construct the desired order- 
N' Latin square L' from L, with N' between + 3 and N + ink = N + Q 
without decreasing the number of transversals significantly. 

For the construction of L, we first take an order-n Latin square S over the alphabet 
[n] that will define the structure of L. We require S to have a collection of i disjoint 
transversals. This is easy to achieve, as for every n >7 there exists a pair of orthogonal 
Latin squares (see, e.g., 13 ]), and so each of them has a decomposition into disjoint 
transversals. 

Let Ai,..., An be a partition of [A^] into n alphabets, each of size nk. Let Si,... ,Sn 
be n order-(nfc) Latin squares where the symbols of Si belong to the alphabet Ai, and 
each of these squares has a decomposition into disjoint transversals. The existence of 
such squares was established in the previous paragraph. 

Our desired Latin square L is a block matrix, where each block is an order-(nfc) 
Latin square that corresponds to a position of S. To construct L, for every position 
{i,j) G [n]^, we replace the symbol s = S{i,j) in S with a Latin square over the 
alphabet Ag. 

Fix i disjoint transversals in S, and call their positions special. For every i G [n], 
we replace every symbol i in a special position with the square Si, and every symbol 
i in a non-special position with a random Latin square over the alphabet Ai, chosen 
independently and uniformly at random from the set of such Latin squares. We also 
call the positions of L from the subsquares corresponding to special positions of S 
special, and these subsquares themselves are also referred to as special subsquares. In 
the future, whenever we refer to subsquares of L, we mean the Latin squares that 
we used to replace symbols of S. 

We need the following two statements about L. The first statement is deterministic, 
as it only deals with special positions of L. We will only use it to construct the slightly 
larger Latin square L' from L. 

Observation 3.1. The Latin square L contains a collection of ink disjoint transversals 
containing only special positions. 
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The second statement gives a probabilistic lower bound for the actual count of 
transversals in L, and later in L'. 

Lemma 3.2. The Latin square L has in expectation at least ((1 — o(l))^)^ transver¬ 
sals not containing any special positions. 

Proof. We consider only transversals that contain exactly b positions from every non¬ 
special subsquare of L. 

First, we bound the number of “potential non-special transversals”, or in other 
words permutation matrices containing b positions from every non-special subsquare. 
To choose the b positions in the first non-special subsquare, we have possibilities 
to determine the rows, and the same number of possibilities for the columns, and we 
should multiply the product by b\, since we are not interested in the order in which the 
b positions are chosen. We end up with {kn{kn — 1) • • • {kn — b 1))^ /6! choices. 

Similarly, dealing with non-special subsquares one by one and choosing all b positions 
in each of them before continuing to the next subsquare, assume that we arrive at a 
(non-special) subsquare such that we already have chosen b positions from i subsquares 
in the same row and j subsquares from the same column of S. To choose b positions 
in the current subsquare, we need to choose b rows out of kn — ib possible rows, and b 
columns out of kn — jb possible columns, and combine them to get b positions. This 
results in a total number of 


{{kn — ib) - ■ ■ {kn — ib — b-k 1)) ■ {{kn — jb) ■ ■ ■ {kn — jb — b-k 1)) /b\ 


possible choices. 

Notice crucially that independently of the order in which we are dealing with the 
subsquares, for every i < n — i, there will be exactly n — £ non-special subsquares for 
which we dealt with exactly i subsquares in the same row of S before, and similarly 
exactly n — i times we will deal with a non-special subsquare for which we dealt with 
exactly i subsquares in the same column of S before. We end up with a total number 
of 


{knl)^"- 

Qj\'^n{n—£) ^ g2n2fc 


^bn(n—£) 

^bn{n—i) 


( 1 - 0 ( 1 )) 


N 




such permutation matrices. 

Now we need to estimate the probability P for an arbitrary such permutation matrix 
to correspond to an actual transversal in L. Notice that we are not aiming for the best 
possible estimations, but just for those that will suffice for our purposes. Since we 
already made sure that there is exactly one position from every row and every column, 
there are just two bad events left: 


• Ei: Two ones of the permutation matrix from different subsquares of L corre¬ 
spond to the same symbol. 

• E 2 : Two ones of the permutation matrix from the same subsquare of L corre¬ 
spond to the same symbol. 


We interpret the random process of creating L from S as follows. We replace non¬ 
special entries of S by subsquares one-by-one, and check whether Ei or E 2 occurred, 
after every replacement. 
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We consider E 2 first. Formally, we have b fixed positions, draw a random Latin 
square over a given alphabet, and are wondering whether in any two of the chosen 
positions, the symbols are the same. But since permuting the rows and columns of a 
random Latin square does not change the probability space, we can assume that we 
are given an arbitrary Latin square, and chose b random positions in it from different 
rows and columns. Notice that it is not important now what choices we made in other 
subsquares, neither the positions (their rows and columns) nor the symbols there! For 
a fixed non-special subsquare of L that we are now looking at, assume that we already 
chose some number 0 < i < 6 of positions in different rows and different columns, 
and all corresponding symbols were different - and we are now choosing a random 
(i -|- l)-st position from the remaining (kn — i) x {kn — 'i)-square. To estimate the 
desired probability, observe that every symbol from one of the already chosen positions 
occurs at most kn — i times in the remaining [kn — i) x [kn — i)-square. Therefore, 
the probability that the symbol in the [i + l)-st chosen position is different from the 
symbols in the previously chosen i positions is at least 1 — = 1 — o(l). Hence, the 

probability that the symbols we chose from the above considered subsquare of L were 
pairwise distinct is 

For the probability of [Ei\E 2 ), assume that we dealt with 0 < j < n — £ subsquares 
over the same alphabet as L and chose b positions with distinct symbols from each of 
them, and are now dealing with the next subsquare (over the same alphabet). Fur¬ 
thermore, since we can also permute the symbols of the alphabet without changing 
the probability space of a random Latin square, we see that, conditioned on the event 
that all b chosen symbols were pairwise distinct, the probability that all of them were 
different from the jb previously chosen symbols from other subsquares over the same 
alphabet is exactly 

^n 

Thus, we obtain 

>e°^^^(^[kn)\/[kn)’^^y = [l-o[l))^/e^. 

Therefore, the expected number of transversals in L not containing any special po¬ 
sition is at least 

and the lemma follows. □ 

In particular, there exists an order-Latin square L with at least ((1 — o(l))| 7 )^ 
transversals not containing any special positions, and satisfying the statements of Ob¬ 
servation EH 

We are now ready to prove Theorem [TH Based on the Latin square L, we construct 
a Latin square L' of order N'. 

Let us take an alphabet A* of size s := N' — N that is disjoint from the alphabet 
of L - we assume w.l.o.g. that A* = [s]. We put in the upper left N x Wcorner of L' 
a copy of L, and in the bottom right s x s-corner of L' an arbitrary Latin square L* 
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over the alphabet A* containing at east one transversal. (Notice that the existence of 
such Latin square L* is trivial since s > 2.) Finally, in L', we replace s of the special 
transversals of L by symbols from A* as follows. One after another, we take a symbol 
i ^ A* and a special transversal from L that we have not dealt with yet. For every 
position (x, y) from this transversal, we put i into the position (x,y), and the symbol 
that was originally in the position (x, y) in L, we move into the positions (x, N + y) 
and {N + x, y) in L'. The resulting L' is therefore a Latin square, and the number of 
transversals in L' is at least as large as the number of transversals in L consisting only 
of non-special positions, proving the theorem. 

The only condition on the relation between N and N' that needs to be satisfied is 
N' G {A^ + 3,... ,N + in} - and this is easy to achieve, since increasing b by one in the 
construction of L increases N by at most and in = & 

4. Concluding remarks and open questions 

Similarly to Theorem 11.31 and keeping in mind the upper bound from Theorem 11.21 
it is natural to consider the lower bound in the higher dimensional case. Analogously 
to the proof of Lemma [3. 2 1 choosing parameters 6, n, k as in that proof and N = kn'^, 
we can prove the following. 

Theorem 4.1. For every d > 3 and for infinitely many values of N, 

r(d,iV)> (^(1-0(1))^J . 

We strongly believe that, similarly to Latin squares, this statement is true for every 
value of N. 

Another natural question concerns the distribution of T{L). Our result implies that 
for a certain collection of Latin squares, the number of transversals is (1 — o(l))’^ ■T{n). 
It is natural to wonder whether the much more general statement is true and T{L) = 
(1 — o(l))” • r(n) for almost every order-n Latin square L? 

Finally, analogously to the question about T{n), let t{n) denote the minimum number 
of transversals over all order-n Latin squares. For even n it is known that t{n) = 0. 
As mentioned in the introduction, the most important open problem in the study of 
transversals is Ryser’s conjecture, which states that t(n) > 0 for odd n. Can we 
strengthen the previous question about the distribution of T{L) in the case of Latin 
squares of odd order? 

Question 4.2. Is it true that t{n) = (1 — o(l))"' • T{n) for odd n? 
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